Goto

Collaborating Authors

 Sea of Okhotsk


A Self-Evolving AI Agent System for Climate Science

Guo, Zijie, Wang, Jiong, Ling, Fenghua, Wei, Wangxu, Yue, Xiaoyu, Jiang, Zhe, Xu, Wanghan, Luo, Jing-Jia, Cheng, Lijing, Ham, Yoo-Geun, Song, Fengfei, Gentine, Pierre, Yamagata, Toshio, Fei, Ben, Zhang, Wenlong, Gu, Xinyu, Li, Chao, Wang, Yaqiang, Chen, Tao, Ouyang, Wanli, Zhou, Bowen, Bai, Lei

arXiv.org Artificial Intelligence

Scientific progress in Earth science depends on integrating data across the planet's interconnected spheres. However, the accelerating volume and fragmentation of multi-sphere knowledge and data have surpassed human analytical capacity. This creates a major bottleneck for discovery, especially in climate science. To address this challenge, we introduce EarthLink, the first self-evolving AI agent system designed as an interactive "copilot" for Earth scientists. Through natural language interaction, EarthLink automates the entire research workflow by integrating planning, code execution, data analysis, and physical reasoning into a unified process that directly addresses this limitation. Beyond efficiency, it exhibits human-like cross-disciplinary analytical ability and achieves proficiency comparable to a junior researcher in expert evaluations on core large-scale climate tasks, including model-observation comparison and climate change understanding. When tasked with an open scientific problem, specifically the discovery of precursors of the Atlantic Niño, EarthLink autonomously developed a research strategy, identified sources of predictability, verified its hypotheses with available data, and proposed a physically consistent mechanism. These emerging capabilities enable a new human-AI research paradigm. Scientists can focus on value and result judgments, while AI systems handle complex data analysis and knowledge integration. This accelerates the pace and breadth of discovery in Earth sciences. The system is accessible at our website https://earthlink.intern-ai.org.cn.


StereoTacTip: Vision-based Tactile Sensing with Biomimetic Skin-Marker Arrangements

Lu, Chenghua, Tang, Kailuan, Hui, Xueming, Li, Haoran, Nam, Saekwang, Lepora, Nathan F.

arXiv.org Artificial Intelligence

Chenghua Lu received the B.S. degree in Mechanical Engineering from Northeastern University, Shenyang, China, in 2017, and the M.S. degree in Mechanical Manufacturing and Automation from the University of Chinese Academy of Sciences, Beijing, China, in 2021. She is currently working toward the Ph.D. degree majoring in Engineering Mathematics with the School of Mathematics Engineering and Technology and Bristol Robotics Laboratory, University of Bristol, Bristol, UK. Her research interests include tactile sensing and soft robotics. Kailuan T ang received a B.S. degree in Communication Engineering from the Southern University of Science and Technology (SUSTech), Shenzhen, China in 2017. He is currently working towards a Ph.D. degree majoring in Mechanics with the School of Mechatronics Engineering, Harbin Institute of Technology.


GraphDOP: Towards skilful data-driven medium-range weather forecasts learnt and initialised directly from observations

Alexe, Mihai, Boucher, Eulalie, Lean, Peter, Pinnington, Ewan, Laloyaux, Patrick, McNally, Anthony, Lang, Simon, Chantry, Matthew, Burrows, Chris, Chrust, Marcin, Pinault, Florian, Villeneuve, Ethel, Bormann, Niels, Healy, Sean

arXiv.org Artificial Intelligence

We introduce GraphDOP, a new data-driven, end-to-end forecast system developed at the European Centre for Medium-Range Weather Forecasts (ECMWF) that is trained and initialised exclusively from Earth System observations, with no physics-based (re)analysis inputs or feedbacks. GraphDOP learns the correlations between observed quantities - such as brightness temperatures from polar orbiters and geostationary satellites - and geophysical quantities of interest (that are measured by conventional observations), to form a coherent latent representation of Earth System state dynamics and physical processes, and is capable of producing skilful predictions of relevant weather parameters up to five days into the future.


Vision-Language Models Meet Meteorology: Developing Models for Extreme Weather Events Detection with Heatmaps

Chen, Jian, Zhou, Peilin, Hua, Yining, Chong, Dading, Cao, Meng, Li, Yaowei, Yuan, Zixuan, Zhu, Bing, Liang, Junwei

arXiv.org Artificial Intelligence

Real-time detection and prediction of extreme weather protect human lives and infrastructure. Traditional methods rely on numerical threshold setting and manual interpretation of weather heatmaps with Geographic Information Systems (GIS), which can be slow and error-prone. Our research redefines Extreme Weather Events Detection (EWED) by framing it as a Visual Question Answering (VQA) problem, thereby introducing a more precise and automated solution. Leveraging Vision-Language Models (VLM) to simultaneously process visual and textual data, we offer an effective aid to enhance the analysis process of weather heatmaps. Our initial assessment of general-purpose VLMs (e.g., GPT-4-Vision) on EWED revealed poor performance, characterized by low accuracy and frequent hallucinations due to inadequate color differentiation and insufficient meteorological knowledge. To address these challenges, we introduce ClimateIQA, the first meteorological VQA dataset, which includes 8,760 wind gust heatmaps and 254,040 question-answer pairs covering four question types, both generated from the latest climate reanalysis data. We also propose Sparse Position and Outline Tracking (SPOT), an innovative technique that leverages OpenCV and K-Means clustering to capture and depict color contours in heatmaps, providing ClimateIQA with more accurate color spatial location information. Finally, we present Climate-Zoo, the first meteorological VLM collection, which adapts VLMs to meteorological applications using the ClimateIQA dataset. Experiment results demonstrate that models from Climate-Zoo substantially outperform state-of-the-art general VLMs, achieving an accuracy increase from 0% to over 90% in EWED verification. The datasets and models in this study are publicly available for future climate science research: https://github.com/AlexJJJChen/Climate-Zoo.


Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration

Mao, Xin, Li, Feng-Lin, Xu, Huimin, Zhang, Wei, Luu, Anh Tuan

arXiv.org Artificial Intelligence

While Reinforcement Learning from Human Feedback (RLHF) significantly enhances the generation quality of Large Language Models (LLMs), recent studies have raised concerns regarding the complexity and instability associated with the Proximal Policy Optimization (PPO) algorithm, proposing a series of order-based calibration methods as viable alternatives. This paper delves further into current order-based methods, examining their inefficiencies in utilizing reward values and addressing misalignment issues. Building upon these findings, we propose a novel \textbf{V}alue-based \textbf{C}ali\textbf{B}ration (VCB) method to better align LLMs with human preferences. Experimental results demonstrate that VCB surpasses existing alignment methods on AI assistant and summarization datasets, providing impressive generalizability, robustness, and stability in diverse settings.


Topic-Selective Graph Network for Topic-Focused Summarization

Zesheng, Shi, Yucheng, Zhou

arXiv.org Artificial Intelligence

Due to the success of the pre-trained language model (PLM), existing PLM-based summarization models show their powerful generative capability. However, these models are trained on general-purpose summarization datasets, leading to generated summaries failing to satisfy the needs of different readers. To generate summaries with topics, many efforts have been made on topic-focused summarization. However, these works generate a summary only guided by a prompt comprising topic words. Despite their success, these methods still ignore the disturbance of sentences with non-relevant topics and only conduct cross-interaction between tokens by attention module. To address this issue, we propose a topic-arc recognition objective and topic-selective graph network. First, the topic-arc recognition objective is used to model training, which endows the capability to discriminate topics for the model. Moreover, the topic-selective graph network can conduct topic-guided cross-interaction on sentences based on the results of topic-arc recognition. In the experiments, we conduct extensive evaluations on NEWTS and COVIDET datasets. Results show that our methods achieve state-of-the-art performance.


Open AI gets GPT-3 to work by hiring an army of humans to fix GPT's bad answers. Interesting questions involving the mix of humans and computer algorithms in Open AI's GPT-3 program

#artificialintelligence

The InstructGPT research did recruit 40 contracters to generate a dataset that GPT-3 was then fine-tuned on. But I [Quach] don't think those contractors are employed on an ongoing process to edit responses generated by the model. A spokesperson from the company just confirmed to me: "OpenAI does not hire copywriters to edit generated answers," so I don't think the claims are correct." So the above post was misleading. I'd originally titled it, "Open AI gets GPT-3 to work by hiring an army of humans to fix GPT's bad answers." I changed it to "Interesting questions involving the mix of humans and computer algorithms in Open AI's GPT-3 program." I appreciate all the helpful comments! Stochastic algorithms are hard to understand, especially when they include tuning parameters. I'd still like to know whassup with Google's LaMDA chatbot (see item 2 in this post).


Chatbots: Still Dumb After All These Years

#artificialintelligence

In 1970, Marvin Minsky, recipient of the Turing Award ("the Nobel Prize of Computing"), predicted that within "three to eight years we will have a machine with the general intelligence of an average human being." The fundamental roadblock is that, although computer algorithms are really, really good at identifying statistical patterns, they have no way of knowing what these patterns mean because they are confined to MathWorld and never experience the real world. It's a brown-throated thrush, but in Germany it's called a halsenflugel, and in Chinese they call it a chung ling and even if you know all those names for it, you still know nothing about the bird–you only know something about people; what they call that bird. Now that thrush sings, and teaches its young to fly, and flies so many miles away during the summer across the country, and nobody knows how it finds its way," and so forth. There is a difference between the name of the thing and what goes on.